A stacked sequential learning method for investigator name recognition from web-based medical articles

نویسندگان

  • Xiaoli Zhang
  • Jie Zou
  • Daniel X. Le
  • George R. Thoma
چکیده

“Investigator Names” is a newly required field in MEDLINE citations. It consists of personal names listed as members of corporate organizations in an article. Extracting investigator names automatically is necessary because of the increasing volume of articles reporting collaborative biomedical research in which a large number of investigators participate. In this paper, we present an SVM-based stacked sequential learning method in a novel application – recognizing named entities such as the first and last names of investigators from online medical journal articles. Stacked sequential learning is a meta-learning algorithm which can boost any base learner. It exploits contextual information by adding the predicted labels of the surrounding tokens as features. We apply this method to tag words in text paragraphs containing investigator names, and demonstrate that stacked sequential learning improves the performance of a nonsequential base learner such as an SVM classifier.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Problem-Based Learning Approach in Medical Education in Iran: A Systematic literature Review

Introduction: Problem solving skills are at the highest level of human cognition and are considered to be the most valuable educational goals. Due to the nature of the discipline, medical students need to upgrade their problem-solving skills. This article aimed to investigate the findings of studies on the use of problem-based Learning in medical education. Methods: In this systematic review, u...

متن کامل

Using WebQuest in Medical Education

Introduction: Today modern teaching and learning approaches in medical education have received considerable attention. This paper aims to introduce WebQuest as a new method of inquiry-based learning through the use of Internet. Also its application in medical sciences education in general, and especially nursing education is explained. Methods: To find articles related to the WebQuest topic, t...

متن کامل

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features

Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...

متن کامل

The Outcomes of Ethics Education to Medical Students Based on Moral Reasoning Models

Introduction: For years, the importance of medical ethics education in medical schools has been emphasized but there is no consensus over learning goals yet. This study aimed to investigate the learning outcomes of medical ethics education based on models of moral reasoning. Methods: This study is a review using proper keywords in databases such as Medline, Web of Science, Scoupus, and Eric li...

متن کامل

A New Method for Detecting Ships in Low Size and Low Contrast Marine Images: Using Deep Stacked Extreme Learning Machines

Detecting ships in marine images is an essential problem in maritime surveillance systems. Although several types of deep neural networks have almost ubiquitously used for this purpose, but the performance of such networks greatly drops when they are exposed to low size and low contrast images which have been captured by passive monitoring systems. On the other hand factors such as sea waves, c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010